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Abstract 

In this paper we present a decoupling inequality that shows that multivariate U- 
statistics can be studied as sums of (conditionally) independent random variables. This 
result has important implications in several areas of probability and statistics including 
the study random graphs and multiple stochastic integration. More precisely, we get the 
following result: 

Theorem 1. Let {Xj} be a sequence of independent random variables in a measurable 

space (S, S), and let {X^ }, j = 1, k be k independent copies of {Xi}. Let fi x i 2 ...i k be 
families of functions of k variables taking [S x ... x S) into a Banach space (B, \ \ ■ | |). Then, 
for all n > k > 2, t > 0, there exist numerical constants Ck depending on k only so that, 

P{\\ E fix^S\x^\...,x^)\\>t) 

l<i 1 =£i 2 =£...=£i k <n 

<C k P(C k \\ J2 fn...i h {xll\x\l\...,X^)\\>t). 

l<i 1 ^i 2 ^...^i k <n 

The reverse bound holds if in addition, the following symmetry condition holds almost 
surely 

fi\i 2 ...i k \ X%\ 5 X-i 2 , Xi k ) V(2) •••V(fe) (^*7r(i) ' ^«7r(2) ' ^V(fc))' 

for all permutations % of (1, k). 

1. Introduction 

In this paper we provide the multivariate extension of the tail probability decoupling 
inequality for generalized U-statistics of order two and quadratic forms presented in de la 
Pena and Montgomery- Smith (1993). This type of inequality permits the transfer of some 
results for sums of independent random variables to the case of U-statistics. Our work 
builds mainly on recent work of Kwapien and Woyczynski (1992) as well as on results for 
U-statistics from Gine and Zinn (1992) and papers dealing with inequalities for multilinear 
forms of symmetric and hypercontractive random variables in de la Pena, Montgomery- 
Smith and Szulga (1992), and de la Pena (1992). It is to be remarked that the decoupling 
inequalities for multilinear forms introduced in McConnell and Taqqu (1986) provided us 
with our first exposure to this decoupling problem. For a more expanded list of references 
on the subject see, for example, Kwapien and Woyczynski (1992). 
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2. Main Result 

Theorem 1. Let {Xi} be a sequence of independent random variables in a measurable 
space («S, S), and let {X^ }, j = 1, k be k independent copies of {Xi}. Let fi 1 i 2 ...i k be 
families of functions of k variables taking (S x ... x S) into a Banach space (B, 1 1 • 1 1). Then, 
for all n > k > 2, t > 0, there exist numerical constants Cfc, Cjt depending on only so 
that, 

^(ii E /*i...i fc (< 1) ,< 1) »-,< 1) )ii>*) 

l<i 1 ^i 2 ^...^i k <n 

<C k P(C k \\ £ /i 1 ...i fc (^ 1 1) ,^ a 2) JT^H >*)- 

l<i 1 ^i 2 ^...^i k <n 

If in addition, the following symmetry condition holds almost surely 

fi 1 i 2 ...ik(.Xi 1 , Xi 2 , ^Q fc ) = 

/ *7r(l)»7r(2)---*7r(fe) (^tt(I) > ^tt(2) > •"> ^7r(fe)) 

for all permutations 7r of (1, k), then 

^(11 E ^....(xW^V.xJflllM) 

<c fc p(c*|| E /ii-i*^.^.-.^)!!^*)- 

l<i 1 ^i 2 ^...^i k <n 

Note: In this paper we use the notation {i± 7^2 7^ ••• 7^ ik} to denote that all of i±, ...,ik 
are different. 

3. Preliminary Results 

Throughout this paper we will be using two results found in earlier work. The first 
one comes from de la Pena and Montgomery-Smith (1993). For completeness we reproduce 
the proof here. 

Lemma 1. Let X, Y be two i.i.d. random variables. Then 

2/ 

(i) p(||x|| >*) <3P(||x + y|| > -). 
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Proof: Let X, Y, Z be i.i.d. random variables. Then 



P(\\X\\>t) 

= P(\\(X + Y) + (X + Z)-(Y + Z)\\ >2t) 

< P(\\X + Y\\> 2t/3) + P(\\X + Z\\> 2t/3) + P(\\Y + Z\\ > 2t/3) 
= 3P(\\X + Y\\ > 2t/3). 

The second result comes from Kwapien and Woyczynski (1992) and can also be found 
in de la Pena and Montgomery-Smith (1993). 



Proposition 1. Let Y be any mean zero random variable with values in a Banach space 
(B, || • ||). Then, for all aeS, 

(2) P(||a + y||>||a||)>|, 

where k = mf x > e B' ^e(x>(P)) 2 • (Here B' denotes the family of linear functionals on B.) 
Proof: Note first that if £ is a random variable for which E£ = 0, then P(£ > 0) > 

I £(g 2 ) ' From tlh s ' we deduce that P(x'(Y) > 0) > | _Epf'(y))2 The resu lt then follows, 
because if x'eB' is such that = 1 and x'(a) = ||a||, then {||a + Y|| > ||a||} contains 
{x'(a + Y) > x'(a)} = {x'(Y) > 0}. 

Lemma 2. Let x, a^, a^^, di- l i 2 ...i k belong to a Banach space (B, \\ ■ ||). Let {q} be a 
sequence of symmetric Bernoulli random variables. Then, 



k 

P (W X + Y1 Yl aii...i r eii-etj| > INI) > Cfc 1 , 

r—l l<i i ^i 2 =£...^i r <n 



for a universal constant 1 < Ck < oo depending on k only. 

Proof: Suppose that x, a^, a^^, a% x i 2 ...i k are in P, then since the e's are hypercontrac- 
tive, by equation (1.4) of Kwapien and Szulga (1991) and the easy argument of the proof 
of Lemma 3 in de la Pena and Montgomery- Smith (1993), for some a > 0, we get 

(-^1 S r =l Sl<ii^...5*i r <n a *i •••V e ii ••• e i r | 4 ) 4 
= (-^1 E f =l Sl<i 1 <...<i r <n &ii...ir e ii-" e v| 4 ) 4 
< a fc (P| Xlr=l Xa<i 1 <...<i fc <n &ii...v e ii-" e v| 2 ) 2 
= a (P| X] r =i Ei<i 1 ^...^i fe <n aii...i r eii---ev| 2 ) 2 , 

where 6^...^ = J^res a Mi)---Mr)' an< ^ ^ denotes the set of all permutations of {1, r}. 

Next, observe that ||f || 4 < o-- 2 ||£|| 2 implies that ||f || 2 < cr- 4 ||£||i. Take x'eB' so that 
\\x'\\ = 1 and x'(x) = \ \x\\, then 

P(\\ X + Er=l J2l<i i ^i 2 ^...^i r <n a h---ir e il--- e i r \\ > IMI) 

> P(x'(x) + Y%=i Ei<^i^...^ r <n x'{a il ... ir )e il ...e ir > x'{x)) 

= P(52r=l El<i l ^2^.../i r <n x'(o>i 1 ...i r )€i 1 —€i r > 0) > C fc 

Note: Throughout this paper we will use Cfc and Ck to denote numerical constants that 
depend on k only and may change from application to application. 

4. Proof of the Upper Bound: 
Our proof of this result is obtained by applying the argument used in the proof of the 
upper bound in the bivariate case plus an inductive argument. Let {ai} be a sequence of 
independent symmetric Bernoulli random variables, P((Ji = 1) = \ and P(o"i = —1) = \. 

Consider random variables (zf^zf ) such that (zf } ,zf } ) = (X^\x^ 2) ) if a t = 1 and 
(zf \ Zf } ) = (Xf } ) if <t< = -1. Then (1 + a t ) and (1 - a,) are either or 2 and 
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these random variables can be used to transform the problem from one involving X's to 
one involving Z's. Let us first illustrate the argument in the case that k = 3. 

{(1 + 0(1 +0(1 + 0/iifeU^*£ ) >*£ ) 



+ (l + <7. 



+ (l + f7, 



(3) 



+ (l-<7 



+ (l+<7. 



+ (l-a. 



+ (l-a 



(l + a i2 )(l - a i3 )f ili2i3 (x£\x£\x^ 



(1 - a ia )(l + a l3 )f lll2l3 (X^,X^,X^ 



(l + a i2 )(l + a i3 )f ili2i3 (X^\x^\x^ 



(1 _ °"i2)(l _ a iz)fili2i3( X i{\ X i 2 \ 



(1 + 0(1 - a,)/^^,^,^ 



(1 - ff4a )(l + 0/ii* a ia(^*iM, 2) 



+ (1 - ffil )(l - a i2 )(l - a i3 )f ili2i3 (X%\ X\l\ 

where the sign "+" is chosen if the superscript of X^ agrees with that of Z t , and 
otherwise. Next, set T n>3 = ^ 

l<ii/j 2 /j 3 <n 

if fyW yC 1 ) y( 2 h + f (Y^ y (1) y (1 h 

iAi^is^i! ' A i 2 > A i 3 J +/iii2i 3 l A i 1 ' A i 2 ' A i 3 J 

+ f fyW y( 2 ) y( 2 h -1- f ry( 2) yW y( 2 h 

+/iii 2 i 3 l A i 1 > A i 2 ' A i 3 J +/iii 2 i 3 l A i 1 > A i 2 ) A i 3 ) 

-Lf (Y (1) Y {2) Y (1 h+f (Y^ Y^ Y (1 h 
+Jili 2 i3{ Ji i 1 ; A i 2 ; A i 3 ) + 7*1*2*3 I A ii > A i 2 > A i 3 J 

+/*i*2* 3 l A i 1 ; A i 2 ; A i 3 J + AiMa^u ; A i 2 ; A i 3 Ji- 



Letting g 2 = <t(X^\ xf \ i = 1, n) we 



get 



T„, 3 = 2 s 



i?(/, 1 , 2 , 3 (4 1) ,4 1) ,< 2) )i^ 2 ). 



More generally. 



l<il/i 2 /i 3 <r 

r , for any 1 < Zi, h < 2, one can obtain the expansion 
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ofc f (7(1-1) y( l k)\ 

* Ji 1 ...i k (^i 1 , ) 

(4) = £ (l±a 4l )...(l±a ifc )/ 4l ... 4fc (X 4 ? l) ,...,xg* ) ). 

l<Jiv,Jfe<2 

The appropriate extension of T n>3 is 

i n,* - 2^ Jii...ifel A i 1 '•••' A i fe J- 

l<i 1 =£...^i k <n l<j'i,---,Jfe<2 

Again, 

^, fc = 2 fc £ E(f il .. Ak (Z^\...,Z^ ) )\g 2 ). 

l<i 1 ^...^i k <n 

From Lemma 1 we get, 

^(11 E /ii...**^.-.^)!!^*)^ 

3P(3ii E {^...i fe (4 1 ^--4 1) )+^--^(4 2) --4 2) )}ii > 2«) = 

3P(3||T n , fc + E /ii-**^' + /ii...i*(< 2) . -»< 2) ) " > 2t ) < 

l<ii^...^i fc <n 

{3P(3||T n , fc || >£) 

+3P(3|| e E / il ... ifc (^ l) ,-^£ fc) )ii>*)} 

i<ii#.../i fe <n i<j 1 ,...j fc <2, not all j's equal 
<{3P(3||T n , fc ||>t) 

(5) + E C k P(C k \\ E fn..^(X^\...,X^)\\>t)}. 

i<iiv,ife<2, not all j's equal i<u/...#i fe <n 

(Recall that C k , c k are numerical constants that depend on k only and may change from 
application to application.) 

Observe also that using (4) and the fact that the cr's are independent from the X's, 
Lemma 2 with x — T njk gives for any fixed 1 < h, l k < 2, 

(6) P(2 k \\ E fi,-i k {zf:\-,zl k) )W > \\TnA\ 1^2) > c" 1 , 

l<ii^.../i fc <n 

Integrating over {||T nj fc|| > t} and using the fact that {(xf \x\ 2 } ) : i = 1, ...,n} has 
the same joint distribution as {{z\ l \ zf 1 ') : i = 1, ...,n} we obtain that 

(7) P(2 fc || E / ll ..., fe (< M ,-,^ fc) )||>t). 

5 



= P(2 k \\ f^..iM:\ -,zl k) )\\ >t)> c^P(\\T n , k \\ > t) 

l<i 1 =£...^i k <n 

It is obvious that the upper bound decoupling inequality holds for the case of U- 
statistics of order 1. Assume that it holds for U-statistics of orders 2, — 1. Putting 
(5) and (7) together with 1 < li, l k < 2, not all Z's equal we get, 

^(IIEi^^i^n/ii...**^.--.^)!!^*) <{3^(3||T n , fc ||>t) 

+ E C k P(C k \\ £ f n ... lk (X^\...,X^)\\>t)} 

i<iiv,ifc<2, not all j's equal i<i 1 ^...^i k <n 

< E C k P(C k \\ J2 fn.., k (X^\...,X^)\\>t) 
i<ii,---,ifc<2, not all j's equal i<i 1 ^...^i k <n 

< C k P(C k \\ J2l<i^. ..^i k <n fii-ik( X il i — i X i^)\\ > *)> 

where again, the last line follows by the decoupling result for U-statistics of orders 2, k— 1 
of the inductive hypothesis. Since the statement "not all j's equal" means that there are 
less than k j's equal, the variables whose j's are equal can be decoupled using (conditionally 
on the other variables) the decoupling inequalities for U-statistics of order 2, k — 1. 
Next we give the proof of the lower bound. 

5. Proof of the Lower Bound 

In order to show the lower bound we require the following result. 
Lemma 3. Let 1 < I < k. Then there is a constant C k such that 

p(\\ E / ll ... fe (< 1) ,< l) ,...,x i ( fe l) )n>t) 
> c^nw E E fn-i k ( x t^ x t \-,xH h > c k t). 

l<ii#i 2 #.../ifc<n l<ji,--.,jk<l 

Proof: Let {5 r }, r = 1, /, be a sequence of random variables for which P(5 r = 1) = j 

and P(5 r = 0) = 1 — j, and Y^ r =i &r = 1- Set e r = S r — j for r = 1, /. Then, it is easy 
to see that there exists oi > depending only upon / such that for any real number xq 
and any sequence of real constants {a^} 

(8) \\xq + E a ^ e T-ll4 < \\%o + c^ -1 E arer ll 2 ' 

One can also use the results of Section 6.9 of Kwapien and Woyczynski (1992) (Pg. 180, 
181) to assert this since the e's satisfy the conditions 1. through 3. stated there. 
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Let {(Sn, Su), i = l,...,n} be n independent copies of (Si,..., Si). As before, we 
define 



(9) €ij — Sij 



Since the vectors Si = (en, ...,eu) are independent, by an argument given in Kwapien and 
Szulga (1991), for % = 1, ...,n, for all constants xo,ciij in R, 

n l n I n I 

(10) \\x + ^^a ir e ir || 4 < \\x + cr z~ 1 $^S a%rtir ' ' 2 - 1 l x o + ^ S airQr ' ' 2 ' 

i=l r=l i=l r=l i=l r=l 

and recentering, we obtain 

n l n I 

(11) \\x + ^2^2a ir S ir \\ 4 < af 1 \\x + ^2^2 

i=l r=l i=l r=l 

Next we use the sequence Si,i = l,...,n in defining the analogue of the Z's used in 
our proof of the upper bound. 

For each i, let Zi = X^' if 8^ = 1. Then, {Zi, i = l,...,n} has the same joint 
distribution as {X^\ i = 1, ...,n} and 

fi 1 ...i k (Zi 1 , ■-, Zi k ) = Si 1 j 1 ...Si k j k fi 1 ,_i k (X^ \ ...,x^ k ). 

l<3i,h, — ,3k<l 

The fact that ESi r j r = j for all i r , j r gives, 

E(fi 1 ... ik (Z il , Z ik )\Qi) = (-) fc ^ fii...i k ( X ii 1 \---) X il k) )-> 



I 

l<3i,---,3k<l 



where Q x = a((X^ , x\ l) ), i = l,...,n). 
Let 

Un = 12l<i 1 ^i 29 L...^i k <n fhi2---ik > •"> ^fe) 

= ^h3i---^ik3kfh---ik( X ii^'''i X ii^)- 
l<i 1 ^i 2 ^...^i k <n l<ji,...,j k <l 

Let X>j = Since the Vs are independent of the X's, if we let 

9i\---i k \Pi\ i •••■> Di k ) 

= ^2 ^^i3i---^ik3kfii---ik( X ii J ' ' '■> X il )• 
l<3i,---,3k<l 

then, since 

/ ii...ifc (^Ql > •••) -^ifc) — /i(7r(l))---*(7r(fc)) (^tt(I) ' ^Tr(fe))' 
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we have that, 

9ii-..i k (^ii > •••) T^ik) — 9i(-K(i))---i(-K(k)) (^\r(i) » •"' ^V(fc)) - 

Therefore, the two sided decoupling inequality in de la Pena (1992) can be applied and, for 
every convex increasing function $, every (^-measurable function T, and k independent 
copies T>i \ r = 1, k of T>i there exists numerical constants A k , B k so that 

E(<S>(A k \\T + J2 9n...i k (^-^i k )\\M) 

l<i 1 =£i 2 =£...=£i k <n 
l<i 1 ^i 2 ^...=£i k <n 

< E($(B k \\T + J2 9i 1 ...i k (V il ,...,V ik )\\)\g l ). 

This result with (11) shows that conditionally on Q\ 

(12) \\U n - T n ]| 4 < a7 —\\U n - T n || 2 , 
where 

T n = E(u n \g l ) = (]{ Y E fi 1 ..A h (xt\x^\... 1 x^). 

l<i 1 ^i 2 ^...^i k <n l<ji,--.,j k <l 

(See also the proofs of Lemma 2 and Lemma 6.5.1 of Kwapien and Woyczynski (1992)). 
Thus we have that, 

(13) ^(N^ll > ll^lil^) > c^ 1 . 

This follows from the use of (12) and Proposition 1 with a = T n and Y = U n — T n . We 
also use the fact that for any random variable £ and positive constant c, ||£||4 < c||£||2 
implies that ||£||2 < c 2 ||£||i (See also the proof of Lemma 2 for the approach to transfer 
the problem from one on Banach space valued random variables to one on real valued). 
Integrating (13) over the set {||T n || > t} we get 

^(11 J2l<i 1 ^i 2 ^...^i k < n fii—iki^n i^i 2 '•"'^ifc )H— *) 

= P(\\ Y j ix ...i k ^Z^...,Z ik )\\>t) 

l<i 1 ^i 2 =£...=£i k <n 

>c- k l P(C k \\ Y E fn.^(X^\x^\.^X^)\\>^ 

l<i 1 ^i 2 ^...^i k <n l<ji,...,j k <l 

and Lemma 3 is proved. 
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The end of the proof of the lower bound follows by using induction and the iterative 
procedure introduced to obtain the proof of the lower bound multivariate decoupling in- 
equality in de la Pena (1992). We give a different expression of the same proof, motivated 
by ideas from de la Peha, Montgomery-Smith and Szulga (1992). We will use Sk to denote 
the set of permutations of {1, k}. 

The Mazur-Orlicz formula tells us that for any 1 < ji,...,jk < k that 

0<5i,...,<5 fe <l 

is unless j\, ...,jk is a permutation of 1, k, in which case it is 1. Hence 

Ef /-yWl)) y(w(*0)\ 

Jn...ik K-^i! i • • • i I 

= E (-I)*"* 1 --* E 8 h ...8 jk fn...i k {X^\...,X^). 

0<<5i,...,5 fe <l l<3i,-~,3k<k 

By the symmetry properties on /, 

/ , Jl^...l k \^i 1 T--1^i k ) 

l<i 1 ^...^i k <n 

= ^ E E {-i) k - s *-- s > E s h ..ju^d^\...,x^). 

l<i 1 jL...jLi k <n 0<<5i,...,<5 fe <l l<j 1 ,...,j k <k 

Therefore, 

Pr(|| E fi 1 ...i k {x\l\...,X^)\\>t) 

l<i 1 =£...=£i k <n 

< E Pr (n E E ^•••^/«i...i fc (^ l) »---»^ fc) )ii>*«*/2*) 

0<5i,...,5fc<l l<i 1 =£...=£i k <n l<j'i,...,Jfe<fc 

= E (/) Pr (n E E Ai...*^ . > ^/2 fc ), 

and this combined with Lemma 3 is sufficient to show the result. 
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